智能论文笔记

Blind Restoration of Real-World Audio by 1D Operational GANs

Turker Ince , Serkan Kiranyaz , Ozer Can Devecioglu , Muhammad Salman Khan , Muhammad Chowdhury , Moncef Gabbouj

分类：机器学习

2022-12-30

Objective: Despite numerous studies proposed for audio restoration in the literature, most of them focus on an isolated restoration problem such as denoising or dereverberation, ignoring other artifacts. Moreover, assuming a noisy or reverberant environment with limited number of fixed signal-to-distortion ratio (SDR) levels is a common practice. However, real-world audio is often corrupted by a blend of artifacts such as reverberation, sensor noise, and background audio mixture with varying types, severities, and duration. In this study, we propose a novel approach for blind restoration of real-world audio signals by Operational Generative Adversarial Networks (Op-GANs) with temporal and spectral objective metrics to enhance the quality of restored audio signal regardless of the type and severity of each artifact corrupting it. Methods: 1D Operational-GANs are used with generative neuron model optimized for blind restoration of any corrupted audio signal. Results: The proposed approach has been evaluated extensively over the benchmark TIMIT-RAR (speech) and GTZAN-RAR (non-speech) datasets corrupted with a random blend of artifacts each with a random severity to mimic real-world audio signals. Average SDR improvements of over 7.2 dB and 4.9 dB are achieved, respectively, which are substantial when compared with the baseline methods. Significance: This is a pioneer study in blind audio restoration with the unique capability of direct (time-domain) restoration of real-world audio whilst achieving an unprecedented level of performance for a wide SDR range and artifact types. Conclusion: 1D Op-GANs can achieve robust and computationally effective real-world audio restoration with significantly improved performance. The source codes and the generated real-world audio datasets are shared publicly with the research community in a dedicated GitHub repository1.

translated by 谷歌翻译

Deep Learning based Automatic Quantification of Urethral Plate Quality using the Plate Objective Scoring Tool (POST)

Tariq O. Abbas , Mohamed AbdelMoniem , Ibrahim Khalil , Md Sakib Abrar Hossain , Muhammad E. H. Chowdhury

分类：计算机视觉 | 人工智能

2022-09-28

目标：探索深度学习算法进一步简化和优化尿道板（UP）质量评估的能力，使用板客观评分工具（POST），旨在提高Hypospadias修复中提高评估的客观性和可重复性。方法：五个关键的邮政地标是由专家在691图像数据集中的专家标记，该数据集接受了原发性杂质修复的青春期前男孩。然后，该数据集用于开发和验证基于深度学习的地标检测模型。提出的框架始于瞥见和检测，其中输入图像是使用预测的边界框裁剪的。接下来，使用深层卷积神经网络（CNN）体系结构来预测五个邮政标记的坐标。然后，这些预测的地标用于评估远端催化性远端的质量。结果：所提出的模型准确地定位了gan区域，平均平均精度（地图）为99.5％，总体灵敏度为99.1％。在预测地标的坐标时，达到了0.07152的归一化平均误差（NME），平均平方误差（MSE）为0.001，在0.1 nme的阈值下为20.2％的故障率。结论：此深度学习应用程序在使用邮政评估质量时表现出鲁棒性和高精度。使用国际多中心基于图像的数据库进行进一步评估。外部验证可以使深度学习算法受益，并导致更好的评估，决策和对手术结果的预测。

translated by 谷歌翻译

An Approach of Adjusting the Switch Probability based on Dimension Size: A Case Study for Performance Improvement of the Flower Pollination Algorithm

Tahsin Aziz , Tashreef Muhammad , Md. Rashedul Karim Chowdhury , Mohammad Shafiul Alam

分类：神经与进化计算

2022-08-20

大自然影响了许多元元素算法。在过去的几十年中，它们的数量一直在升级。这些算法中的大多数试图模仿自然的生物学和物理现象。这项研究集中在花授粉算法上，该算法是几种生物启发的算法之一。建议使用特定的全球授粉和局部授粉策略，建议在限制空间中进行花粉谷物探索和剥削。作为一种“群”元元素算法，其强度在于找到最佳解决方案的附近，而不是识别最小值。这项工作详细介绍了对原始方法的修改。这项研究发现，通过更改“开关概率”的特定值，具有不同尺寸和功能的动态值，结果主要比原始花授粉法改进。

translated by 谷歌翻译

BIO-CXRNET: A Robust Multimodal Stacking Machine Learning Technique for Mortality Risk Prediction of COVID-19 Patients using Chest X-Ray Images and Clinical Data

Tawsifur Rahman , Muhammad E. H. Chowdhury , Amith Khandakar , Zaid Bin Mahbub , Md Sakib Abrar Hossain , Abraham Alhatou , Eynas Abdalla , Sreekumar Muthiyal , Khandaker Farzana Islam , Saad Bin Abul Kashem

分类：计算机视觉 | 机器学习

2022-06-15

快速准确地检测该疾病可以大大帮助减少任何国家医疗机构对任何大流行期间死亡率降低死亡率的压力。这项工作的目的是使用新型的机器学习框架创建多模式系统，该框架同时使用胸部X射线（CXR）图像和临床数据来预测COVID-19患者的严重程度。此外，该研究还提出了一种基于nom图的评分技术，用于预测高危患者死亡的可能性。这项研究使用了25种生物标志物和CXR图像，以预测意大利第一波Covid-19（3月至6月2020年3月至6月）在930名Covid-19患者中的风险。提出的多模式堆叠技术分别产生了89.03％，90.44％和89.03％的精度，灵敏度和F1分数，以识别低风险或高危患者。与CXR图像或临床数据相比，这种多模式方法可提高准确性6％。最后，使用多元逻辑回归的列线图评分系统 - 用于对第一阶段确定的高风险患者的死亡风险进行分层。使用随机森林特征选择模型将乳酸脱氢酶（LDH），O2百分比，白细胞（WBC）计数，年龄和C反应蛋白（CRP）鉴定为有用的预测指标。开发了五个预测因素参数和基于CXR图像的列函数评分，以量化死亡的概率并将其分为两个风险组：分别存活（<50％）和死亡（> = 50％）。多模式技术能够预测F1评分为92.88％的高危患者的死亡概率。开发和验证队列曲线下的面积分别为0.981和0.939。

translated by 谷歌翻译

A Shallow U-Net Architecture for Reliably Predicting Blood Pressure (BP) from Photoplethysmogram (PPG) and Electrocardiogram (ECG) Signals

Sakib Mahmud , Nabil Ibtehaz , Amith Khandakar , Anas Tahir , Tawsifur Rahman , Khandaker Reajul Islam , Md Shafayet Hossain , M. Sohel Rahman , Mohammad Tariqul Islam , Muhammad E. H. Chowdhury

分类：机器学习

2021-11-12

心血管疾病是世界各地最常见的死亡原因。为了检测和治疗心脏相关的疾病，需要连续血压（BP）监测以及许多其他参数。为此目的开发了几种侵入性和非侵入性方法。用于持续监测BP的医院中使用的大多数现有方法是侵入性的。相反，基于袖带的BP监测方法，可以预测收缩压（SBP）和舒张压（DBP），不能用于连续监测。几项研究试图从非侵入性可收集信号（例如光学肌谱（PPG）和心电图（ECG））预测BP，其可用于连续监测。在这项研究中，我们探讨了自动化器在PPG和ECG信号中预测BP的适用性。在12,000岁的MIMIC-II数据集中进行了调查，发现了一个非常浅的一维AutoEncoder可以提取相关功能，以预测与最先进的SBP和DBP在非常大的数据集上的性能。从模拟-II数据集的一部分的独立测试分别为SBP和DBP提供了2.333和0.713的MAE。在40个主题的外部数据集上，模型在MIMIC-II数据集上培训，分别为SBP和DBP提供2.728和1.166的MAE。对于这种情况来说，结果达到了英国高血压协会（BHS）A级并超越了目前文学的研究。

translated by 谷歌翻译

EDITH :ECG biometrics aided by Deep learning for reliable Individual auTHentication

Nabil Ibtehaz , Muhammad E. H. Chowdhury , Amith Khandakar , Serkan Kiranyaz , M. Sohel Rahman , Anas Tahir , Yazan Qiblawey , Tawsifur Rahman

分类：机器学习 | 人工智能

2021-02-16

近年来，基于生理信号的认证表现出伟大的承诺，因为其固有的对抗伪造的鲁棒性。心电图（ECG）信号是最广泛研究的生物关像，也在这方面获得了最高的关注。已经证明，许多研究通过分析来自不同人的ECG信号，可以识别它们，可接受的准确性。在这项工作中，我们展示了EDITH，EDITH是一种基于深入的ECG生物识别认证系统的框架。此外，我们假设并证明暹罗架构可以在典型的距离指标上使用，以提高性能。我们使用4个常用的数据集进行了评估了伊迪丝，并使用少量节拍表现优于先前的工作。 Edith使用仅单一的心跳（精度为96-99.75％）进行竞争性，并且可以通过融合多个节拍（从3到6个节拍的100％精度）进一步提高。此外，所提出的暹罗架构管理以将身份验证等错误率（eer）降低至1.29％。具有现实世界实验数据的Edith的有限案例研究还表明其作为实际认证系统的潜力。

translated by 谷歌翻译

PPG2ABP: Translating Photoplethysmogram (PPG) Signals to Arterial Blood Pressure (ABP) Waveforms using Fully Convolutional Neural Networks

Nabil Ibtehaz , Sakib Mahmud , Muhammad E. H. Chowdhury , Amith Khandakar , Mohamed Arselene Ayari , Anas Tahir , M. Sohel Rahman

分类：机器学习

2020-05-04

心血管疾病是死亡率最严重的原因之一，每年在世界各地遭受沉重的生命。对血压的持续监测似乎是最可行的选择，但这需要一个侵入性的过程，带来了几层复杂性。这激发了我们开发一种通过使用光杀解功能图（PPG）信号的非侵入性方法来预测连续动脉血压（ABP）波形的方法。此外，我们探索了深度学习的优势，因为它可以通过使手工制作的功能计算无关紧要，这将使我们无法坚持理想形状的PPG信号，这是现有方法的缺点。因此，我们提出了一种基于深度学习的方法PPG2ABP，该方法可以从输入PPG信号中预测连续的ABP波形，平均绝对误差为4.604 mmHg，可保留一致的形状，大小和相位。但是，PPG2ABP的更惊人的成功事实证明，来自预测的ABP波形的DBP，MAP和SBP的计算值超过了几个指标下的现有作品，尽管没有明确培训PPG2ABP。

translated by 谷歌翻译

Detecting Severity of Diabetic Retinopathy from Fundus Images using Ensembled Transformers

Chandranath Adak , Tejas Karkera , Soumi Chattopadhyay , Muhammad Saqib

分类：计算机视觉 | 人工智能

2023-01-03

Diabetic Retinopathy (DR) is considered one of the primary concerns due to its effect on vision loss among most people with diabetes globally. The severity of DR is mostly comprehended manually by ophthalmologists from fundus photography-based retina images. This paper deals with an automated understanding of the severity stages of DR. In the literature, researchers have focused on this automation using traditional machine learning-based algorithms and convolutional architectures. However, the past works hardly focused on essential parts of the retinal image to improve the model performance. In this paper, we adopt transformer-based learning models to capture the crucial features of retinal images to understand DR severity better. We work with ensembling image transformers, where we adopt four models, namely ViT (Vision Transformer), BEiT (Bidirectional Encoder representation for image Transformer), CaiT (Class-Attention in Image Transformers), and DeiT (Data efficient image Transformers), to infer the degree of DR severity from fundus photographs. For experiments, we used the publicly available APTOS-2019 blindness detection dataset, where the performances of the transformer-based models were quite encouraging.

translated by 谷歌翻译

Internet of Things: Digital Footprints Carry A Device Identity

Rajarshi Roy Chowdhury , Azam Che Idris , Pg Emeroylariffion Abas

分类：机器学习

2023-01-01

The usage of technologically advanced devices has seen a boom in many domains, including education, automation, and healthcare; with most of the services requiring Internet connectivity. To secure a network, device identification plays key role. In this paper, a device fingerprinting (DFP) model, which is able to distinguish between Internet of Things (IoT) and non-IoT devices, as well as uniquely identify individual devices, has been proposed. Four statistical features have been extracted from the consecutive five device-originated packets, to generate individual device fingerprints. The method has been evaluated using the Random Forest (RF) classifier and different datasets. Experimental results have shown that the proposed method achieves up to 99.8% accuracy in distinguishing between IoT and non-IoT devices and over 97.6% in classifying individual devices. These signify that the proposed method is useful in assisting operators in making their networks more secure and robust to security breaches and unauthorized access.

translated by 谷歌翻译

Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques

Muhammad Suleman , Muhammad Asif , Tayyab Zamir , Ayaz Mehmood , Jebran Khan , Nasir Ahmad , Kashif Ahmad

分类：自然语言处理

2023-01-01

This paper presents our solutions for the MediaEval 2022 task on DisasterMM. The task is composed of two subtasks, namely (i) Relevance Classification of Twitter Posts (RCTP), and (ii) Location Extraction from Twitter Texts (LETT). The RCTP subtask aims at differentiating flood-related and non-relevant social posts while LETT is a Named Entity Recognition (NER) task and aims at the extraction of location information from the text. For RCTP, we proposed four different solutions based on BERT, RoBERTa, Distil BERT, and ALBERT obtaining an F1-score of 0.7934, 0.7970, 0.7613, and 0.7924, respectively. For LETT, we used three models namely BERT, RoBERTa, and Distil BERTA obtaining an F1-score of 0.6256, 0.6744, and 0.6723, respectively.

translated by 谷歌翻译